Dual Compositional Learning in Interactive Image Retrieval

نویسندگان

چکیده

We present an approach named Dual Composition Network (DCNet) for interactive image retrieval that searches the best target a natural language query and reference image. To accomplish this task, existing methods have focused on learning composite representation of text to be as close embedding possible. refer Network. In work, we propose loop with Correction models difference between in space matches it query. That is, consider two cyclic directional mappings triplets (reference image, query, image) by using both also joint training loss can further improve robustness multimodal learning. evaluate proposed model three benchmark datasets retrieval: Fashion-IQ, Shoes, Fashion200K. Our experiments show our DCNet achieves new state-of-the-art performance all datasets, addition consistently improves multiple are solely based Moreover, ensemble won first place Fashion-IQ 2020 challenge held CVPR workshop.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Interactive learning and probabilistic retrieval in remote sensing image archives

We present a concept of interactive learning and probabilistic retrieval of user-specific cover types in a content-based remote sensing image archive. A cover type is incrementally defined via user-provided positive and negative examples. From these examples, we infer probabilities of the Bayesian network that link the user interests to a pre-extracted content index. Due to the stochastic natur...

متن کامل

Interactive Facial Image Retrieval

Interactive image retrieval is a powerful tool for image queries without input examples. For example, it is useful when searching a photo of a specified criminal only through the recalling of the witness. Unlike traditional text search engine working on input strings or keywords, a system which supports image query is usually required to learn opinions from users by relevance feedback and the r...

متن کامل

Interactive Semantic Image Retrieval

The big challenge in current content-based image retrieval systems is to reduce the semantic gap between the low level-features and high-level concepts. In this paper, we have proposed a novel framework for efficient image retrieval to improve the retrieval results significantly as a means to addressing this problem. In our proposed method, we first extracted a strong set of image features by u...

متن کامل

Learning Query-Dependent Distance Metrics for Interactive Image Retrieval

An approach to target-based image retrieval is described based on on-line rank-based learning. User feedback obtained via interaction with 2D image layouts provides qualitative constraints that are used to adapt distance metrics for retrieval. The user can change the query during a search session in order to speed up the retrieval process. An empirical comparison of online learning methods incl...

متن کامل

Discriminative learning with application to interactive facial image retrieval

The amount of digital images is growing drastically and advanced tools for searching in large image collections are therefore becoming urgently needed. Contentbased image retrieval is advantageous for such a task in terms of automatic feature extraction and indexing without human labor and subjectivity in image annotations. The semantic gap between high-level semantics and low-level visual feat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i2.16271